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1. Introduction 



This review is motivated by a powerful and breathtaking development in "Additive 
Combinatorics" , the direction of study connected with the names of Timothy Gowers, 
Jean Bourgain, Terence Tao and Ben Green. 

The results and methods of Inverse Additive Number Theory have been used sub- 
stantially in this new field. 

Additive Combinatorics now looks towards algebra and computation. Results in 
these areas existed in Inverse Additive Number Theory and they will be discussed 
below. 

This paper is a continuation of the reviews [32] and [33]. 

2. The main theorem 
We consider a finite subset A of Z n (and often Z) with cardinality \A\ = k. We define 

2A = A + A = {x : x = a + b, a e A, b e A}. 
We say that A has the small doubling property if 

\2A\ < Ck, 

where C is a positive constant or a slowly increasing function. The number 

\2A\ 
k 

is called the doubling coefficient of A. 

We define the notion of additive isomorphism of A C Z n onto B C Z m as a bijection 
if : A — > B such that, for all a, b, c, d in A, we have 

<p(a) + <p{b) = <p{c) + <p(d) (1) 

if and only if 

a + b = c + d. 
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We write A ~ B and note that ~ is an equivalence relation. We name a d- dimensional 
parallelepiped the subset D C Z d 

D — {(x 1 ,x 2 ,. ■ ■ ,x d : Xi eZ, < Xi < h h hi>2, 1 <i < d)}, 

where 

\D\ = hih 2 . • • h d . 

Main Theorem. For all finite subsets A e Z /or which \2A\ < Ck, where C is a 
positive constant or C = C{k) is a slowly increasing function, there exist c = c(C) 
and a parallelepiped D with d < [C — 1] and \D\ < ck such that A C (p(D), where ip 
is an isomorphism. □ 

For a given A e Z, we consider the images y?(A) under all isomorphisms y? : A — )■ Z n , 
for all n > 1. The dimension of any y?(A) is the dimension of the smallest affine 
subspace containing it. The maximum dimension of (fi(A) for all such tp is called the 
dimension of A and denoted d(A). 

We will define the notion of volume of A. Let A e Z n , where n is the dimension 
of A. For each isomorphism tp : A — > Z n , let us take the convex hull of <p(A) and let 
be the number of integer points in it. Then we define 

V(A) = mm\(p(A)\. 

v> 

The proof of the Main Theorem was obtained gradually (see [14]-[19]). The final 
text of the proof may be found in [13] and Yuri Bilu's paper [2] . Somewhat different 
proofs were given by Ruzsa [41], [42], Mei-Chu Chang [11] and Sanders [43], with 
better estimates of volume. 

Now, some remarks on estimates of V(A), the volume of A. We can also estimate 
V(A) by the volume 

\D\ = hih 2 . . . h d 

of a rf-dimensional parallelepiped D with a minimal \D\ and A C tp(D), where d is the 
dimension of A and tp is an isomorphism. The set <p(D) C Z is called a generalized 
proper arithmetic progression of dimension d. 

Existing estimates were obtained with the help of <p(D), where <p is not an isomor- 
phism but a homomorphism giving a generalized arithmetic progression which is not 
proper. As a result, we obtain an estimate of the volume V(A) and dimension of A, 
which wait to be improved. Below, we will give a hypothetical best possible estimate. 

3. Hypothetical value of V(A) 

We begin with an example which does not cover all possible values of T = \2A\ but 
is very simple and representative. 
Let 

A = {0,1,..., m- \,2{m- l),4(m- 1), . . . , 2 c ~ 2 (m - 1)} 
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where, as usual, \A\ = k, and 

3 < m < k, (2) 
c = k + 2 - m, (3) 

so that 2 < c < k — 1. 

In this case we have d(A) = 1 and the values of V and T have a very simple form 
for c which is not very large: 

V = 2 c -\m - 1) + 1 = ^2 c (k + 1 - c) + 1 = ^2 C A; + 0(c2 c ) (4) 

and 

T = (k - 1) + (A; - 2) H h m + 2m - 1 = - - 1 + m (k - m) + 2m - 1 = 

2 

= fc " 1+ ^ + 2 ~ C (c-2) + 2(A: + 2-c)-l = 

, (c-l)(c-2) rt . nx , , c 2 + c-4 

= cA; - - 2 c - 2 - 1 = ck = 

2 v 1 2 

= ck + 0(c 2 ). (5) 

If c = 0(log 5 k), 5 < 1, then, for V and T, (4) and (5) give very simple expressions, 
sufficient for today's use. 

Now, we will give examples of the sets A, for which T takes all possible values. We 
will see, that the "net" of sets A, given earlier, represents the general situation in a 
very good way. 

For values m and c as in (2) and (3), let 

a m _i = m - 1 + b, (6) 

where 

0<b<m-3 = k-c-l, (7) 

and denote 

A — {0,1,..., m — 2, am-t, 2a m _i, 4a m _i, . . . , 2 c ~ 2 a m _i}, (8) 
where conditions (6) and (7) are valid. We get 

V = 2 c ~ 2 a m + 1 = l -2 c [m - 1 + b) + 1 = 

= l -2 c (k - c + 1 + b) + 1 = h c (k + b) + 0(c2 c ), (9) 

and 

T=(k-l) + (k-2) + --- + m + 2m-l + b = ck + b + 0(c 2 ). (10) 
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If d(A) > 1, we build the set A in the following manner. Let 

A = A d = {A'e 1 , e 2 , . . • , e d } (11) 

where A'CZ will be defined a bit later. 
For a given value T, we define 

\2A'\ = T - (k + (k - 1) + (k - 2) + • • • + (k - d + 2)). (12) 

We have 

k' = \A'\ — k — d+1. (13) 

For values k' and \2A'\ given in (12) and (13), we can build the set A 1 as in (8) and 
the set A d = A with the help of (11). 

In the case where the values of k and T are given, we can also get the corresponding 
values of V(A) as a function of k and T. It was done in [13, p. 37], where the hypothesis 
about the best estimate of V(A) was formulated. We may reproduce it now in a nicer 
form. 

Hypothesis. The inequality 

V(A) < l -2 c (k - c + 1 + b) + 1 (14) 

gives an estimate from above for V(A) for a given k and T which cannot be improved. 

If this hypothesis were proved to be true, the proof would be important not only 
as an improvement of an estimate of V(A) for a given doubling coefficient T, but 
also the range of values T would cover all possible values at the time when existing 
estimates make sense only for rather small values of T. 

The set A may, because of (11), be put in a parallelepiped D with edges 

h-i — V(A'), h 2 = h 3 = ...h d = 2 

and 

V(D) = 2 d - 1 V(A'). 

We may define the volume of a set A with the help of the volume of D D A for 
a minimal \D\ for such D (a proper ci-dimensional arithmetical progression) and 
formulate a hypothesis about minimal volume in a simpler though less exact way. 

4. On a family of extremal sets 

We would now like to go one step deeper into the study of the structure of A. We 
have used a concrete example of A with given values k and T for which the maximal 
value given in (14) is achieved. 

Let us ask about a list of such A. Here, as is usual in solving an inverse additive 
problem, see [32, p. 5], we study the structure of the set when its characteristic is 
equal to its extremal value. 
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As an example of such sets, let us build a family of sets {B} which are "friendly" 
to A defined in (8) in the following manner: 

\B\ = \A\ = k, 

B = {6o,6i,...,6 fc _i}. 

Define B(s) as a subset of B containing the first s elements of B: 

B(s) = {bo,bi, . . . ,6 s _i}. 

Let B{m) be one of the sets described in Theorem 1.9 of [13], and in the formulation 
given at the end of p. 11, we have 

b = 0, b m -i = a m -i = m — 1 + b, \2B(m) \ = 2m —1 + 6. 

Let p m be the number of such sets. Define the sets B(s) for m < s < k in the 
following manner: Let the sets for some value which is less than or equal to some 
chosen value of s be already known. Take B(s) which is one of them. Define 

sym(B(s)) = maxB(s) — B(s), 

and, for a chosen B(s), take two possibilities 

B'(s + 1) = {B(s) U {2 s - m+1 a m _ 1 }} (15) 

or 

B"(s + 1) = {symB'(s + 1)}. (16) 
Note that for s > m we have 

maxB(s) = 2 s ~ m a m _ l 

and so 

m&xBik) =2 fc ~ m a m _i = 1/4 * 2 c (k - c + 1 + b). (17) 

The number of sets in the family {B} is equal to 2 c ~ 2 p m . 

The values V and T, for each set B just built, are the same as in (9) and (10). 

Let us give a numerical example. Take k = 8, m = 6 and because of (3) we get 
c = 4. Further, take in a m _i = m — 1 + 6, b = 2 and 05 = 7. 

For sets {-8(6)} we obtain the following list: 

{B(6)} = {0, 1, 2, 3, 4, 7}, {0, 1, 2, 3, 5, 7}, {0, 3, 4, 5, 6, 7}, 

{0,2, 4, 5, 6, 7}, {0,2, 3, 4, 5, 7}. (18) 

Therefore, the number of these sets is = 5. 
From (9) and (10) we get 

T = 7 + 6 + 12-l + 2 = 26, (19) 
V = ^2 4 (6 - 1 + 2) + 1 = 29. (20) 
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Take, for example, the set {0,1,2,3,4,7} from {£>(6)}. Using (9) and (10), we 
get two sets {0, 1, 2, 3, 4, 7, 14} and {0, 7, 10, 11, 12, 13, 14} from {B(7)} and four sets 
{0,1,2,3,4,7,14,28}, {0,14,21,25,26,27,28}, {0,7,10,11,12,13,14,28} and 
{0, 14, 15, 16, 17, 18, 21, 28} from {B{8)}. 

As a result we will get 20 sets from {B(8)}. We gave a partial list of extremal sets 
which gives some idea of how to construct them. 

5. Partial case: T <4k -7 
Let us use the following lemma, see [13, p. 24]. 

Lemma. Let A C E n be a finite set of dimension n with \A\ = k. Then 

n(n + l) 

T> (n + l)k- V . 

We see that, if n = 2, then 

T>3A;-3 (21) 

and if n — 3 then 

T > Ak - 6. (22) 

From (21) and (22), we see that if T < 3A; — 4 then d(A) = 1. This case was studied 
in detail in [13, p. 11-14]. As to the case, 

3 A; - 3 < T < Ak - 7, 

it gives a good example of the Hypothesis for a very simple case. 
In this case, (8) takes the form 

A — 0,l,2,...,k — 3,k — 2 + b,2(k — 2 + b) 

where 

T = k - 1 + 2(k - 1) - 1 + b = 3k - 4 + b, 
< b < k - 3. 

Then, according to the hypothesis, A is part of a segment of length 2k — 3 + 2b or 
has dimension two. This case is much simpler than the general case (on the second 
page of this article); I had formulated it immediately after Theorem 1 was proved, 
but even in this simpler case, the problem is still open (for some progress, see Jin 
[38]). 
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6. Group theory 

Over the last few years, there has been exciting development in group theory (Tao [44], 
Breuillard and Green [7], [8], Green [36], Fisher, Katz and Peng [12], Hrushovski [37]) 
connected with sets with small doubling. I'm referring to the results in approximation 
groups. 

An approximation group is a finite subset of a group which, together with the 
property of small doubling, contains a unit and has a symmetry property. 
Here are some of my old results in this direction: 

Theorem I. Let G be a torsion-free group, K and M subsets of G, for which 

2 < \K\, \M\ 

and 

\KM\ = \K\ + |M| - 1, 

then K and M have the form 

K — {a,aq, . . . , aq 1 ^ 1 } (23) 

and 

M = {b,qb,...,q h ~ 1 b}, 

where 

a,b,qeG,q^l. 

And what about K = Ml 

Theorem II. // \K 2 \ = 2\K\ — 1 then K has the form (23) and either aq = qa or 
qaq = a holds. 

The proof is in [28]. 

The immediate problem here is to describe the structure of the subset A C G in a 
case when \ A 2 \ < 3k — 4 and the group G is torsion-free. In the study of approximate 
groups, the study of subsets of groups of special kinds take place, and this is a very 
natural and rewarding turn. If G is not torsion free then the doubling coefficient 
may be less than two. In this case, the structure of A was described in [20] for the 
doubling coefficient 8/5. What about bigger coefficients? 

7. All small subsets have small doubling 

Now, we will describe the study of group structure, which is different from the fast 
growing study of approximate groups, but which also uses the idea of small doubling 
as a main tool, and sometimes arrives at similar results. They show that some subsets 
of a group have the structure of a subgroup, a coset or a union of cosets for some 
normal subgroup. Let us translate the notion of isomorphic subsets to the notion 
of similarity of multiplication tables (Latin squares). We will now call an algebraic 
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operation a multiplication, and take a product ab instead of the sum a + b. We will 
enter he elements of the sets D and F into the following tables: 



a b c 
a A B C 



(24) 



and 



x 



x 

y 



(25) 



Here a,b,c, . . . in D and x,y,z, . . . in F. 

Entries in a multiplication table may be put in some freely chosen order. So, let us 
change the order of columns in (25), and correspondingly the order of lines in (25), 
in such a way that we will get the following table instead of (25): 

ip(a) ip(b) ip(c) 

1m (26) 
<p{c) 

As to notation inside the multiplication table (24), the elements of D 2 and D may 
be different, and we denote elements of D by lower-case Latin letters and elements of 
D 2 by upper-case ones, and we denote equal elements of D 2 by the same letter and 
different ones by different letters. And now, how is condition (1) in multiplicative 
form 

ab = cd ^ (p(a)ip(b) = ip(c)ip(d) 

formulated in the case of Latin squares? The condition ab = cd means the element 
of the table on the line with entry a and on column b is equal to the element with 
line c and column d. The condition ip(a)(p(b) = (p(c)ip(d) tells us that elements in the 
same places in the table (26) are equal and have the same notation. If the equality 
is not valid in table (25), it is not valid for corresponding places in table (26), as 
well. So, as formulated in [22, p. 141], let D and F be Latin squares of the same 
order; also, let 9 be a bijection of D onto F. Applying 9 to D, we have a new Latin 
square 9[D]. If F can be obtained from 9[D] by a permutation of rows and the same 
permutation of columns of 9[D], then D and F are said to be isomorphic. In paper [22] 
the classification of all isomorphic classes of three-element sets was presented. The 
number of commutative classes is six, and of noncommutative classes is 45. Having 
these and similar results in mind, we can now ask plenty questions. 
Some examples. 
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1) Let us have, for each three-element sets of a group G, \A 2 \ < a, a < 8. All such 
groups were described in [1], [39]. What if \A\ = 4 and \A 2 \ < a, a < 15? 

2) Take one subset of three elements with a given multiplication table. Describe all 
groups without this Latin square. Do the same for a given family of Latin squares. 
Looking at groups in an artistic way, we can think of a group as a complete build- 
ing. The subgroups will then be, say, storeys of this building, and small subsets, 
described with the help of their multiplication tables, as the bricks used to construct 
the building. The kind of material used in the course of construction has an influence 
on the properties of the whole building. We may begin to analyze kinds and numbers 
of different kinds of small subsets which are used in groups of some known kinds, 
say, crystallographic groups. It would be important, though not that simple, to find 
some connection between the properties of such groups and some properties of their 
composition. 

8. Subsets of special elements 

Let us introduce a notion of an (n, m)-special element in a group G. Element a G G 
is called (n,m) special if \K n \ < m < 2™, for all K = {a,g}, g G G, where n,m are 
positive integers m, n > 2. By K n we mean the set of elements g G G having at least 
one representation of the form g = ki, k 2 , . . . , k n . The set of all the (n, m)-special 
elements is denoted by S mjTl (G). The notion of a special element is attributed to J.G. 
Berkovich. In [6], it was shown that 62,3 ((j) and 63,5 (G) are normal subgroups of G. 
Some further results and questions in this direction can be found in [6]. The main 
problem here: Let n be fixed. For which m the set S is a normal subgroup of G, union 
of cosets,...? 

9. Computation complexity 

Results in Integer Programming were obtained mainly between the years 1987 and 
1990, see [9], [10], [21], [25], [26], [27], [30], [31]. The main problem which was studied 
there: Let an equation 

ai%i + a 2 x 2 + h a m x m = b 

be given, where Oj are different large positive integers. Do solutions exist of this 
equation or, perhaps, there are no solutions? The unknowns X{ take the values 
or 1. Direct computation, see, for example [40], gave a solution to the problem for m 
equal to two-three hundred, and Oj of order 10 10 . The use of analytical methods of 
Additive Number Theory enabled us to formulate general conditions when a solution 
and, in fact, many solutions exist. Methods of Inverse Additive Number Theory 
enabled us to build the structure of the set A = ai in the case where there are no 
solutions. Algorithms and programs were built for values of m of order 10 7 allowing 
us to produce computations in a split second (see [10]). A problem for further study 
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is to go from one equation to a system of several equations (see [31] and the thesis of 
Alain Plagne). 
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